A Transformation to Convert Packing Code to Compact Datatypes for Efficient Zero-Copy Data Transfer

نویسندگان

  • Fredrik Kjolstad
  • Torsten Hoefler
  • Marc Snir
چکیده

Many high performance applications spend considerable timepacking data into contiguous communication buffers. Datatypesprovide an alternative by describing the layout of the communi-cated data. This empowers the runtime system to retrieve non-contiguous elements directly from application data structures.However, programmers find complex datatypes hard to use andare reluctant to invest time and effort to rewrite packing code todatatype code. Fortunately, the transformation from packing codeto datatypes can be automated, and the programmer can replacepacking code with datatypes at the push of a button. The transfor-mation allows easy porting of applications to new machines thatbenefit from datatypes, thus improving programmer productivity.We present an algorithm for converting packing code to datatypecode based on a novel IR and a suite of optimizations. We have im-plemented the algorithm in a tool that transforms C packing codeto an MPI datatype, and rewrites the packing code consumer toinstead use the datatype. Our evaluation shows that our algorithmis applicable to real-world packing code, that it is fast enough to beused interactively, and that the datatypes it produces are compactand well optimized. Finally, we evaluate the performance of thecode produced by our tool, showing that it outperforms the orig-inal packing code on a state-of-the art system. However, the realbenefit of datatypes is in the future, when hardware support fornon-contiguous gather-scatter transfers becomes ubiquitous.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient Using MPI Datatypes

Many parallel applications need to communicate noncontiguous data. Most applications manually copy (pack/unpack) data before communications even though MPI allows a zero-copy specification. In this work, we study two complex use-cases: (1) Fast Fourier Transformation where we express a local memory transpose as part of the datatype, and (2) a conjugate gradient solver with a checkerboard layout...

متن کامل

Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost

The MPI Standard supports derived datatypes, which allow users to describe noncontiguous memory layout and communicate noncontiguous data with a single communication function. This feature enables an MPI implementation to optimize the transfer of noncontiguous data. In practice, however, few MPI implementations implement derived datatypes in a way that performs better than what the user can ach...

متن کامل

Comaprison of Mass-Transfer Efficiencies of Scc and Structured Packing

In a previous paper, pressure drop, flooding and mass-transfer characteristics of a novel pilot-scale distillation column called spinning cone column (SCC) were presented. Here, we present the result of comparison of mass-transfer efficiencies of SCC and structured packing. Comparison of SCC and structured packing mass-transfer characteristics show that the gas and liquid-side height of transfe...

متن کامل

Automatic Memory Optimizations for Improving MPI Derived Datatype Performance

MPI derived datatypes allow users to describe noncontiguous memory layout and communicate noncontiguous data with a single communication function. This powerful feature provides an MPI implementation with optimized transfer of noncontiguous data. In practice, however, many implementations of MPI derived datatypes perform poorly, which makes application developers avoid using this feature. In th...

متن کامل

An alternative transformation in ranking using l1-norm in data envelopment analysis

Jahanshahloo et al. (Appl Math Comput 153:215–224, 2004) propose a method for ranking extremely efficient decision making units (DMUs) in data envelopment analysis (DEA) using super-efficiency technique and l1-norm and they show that the presented method is able to eliminate the existing difficulties in some methods. This paper suggests an alternative transformation to convert the nonlinear mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011